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Method And System for Video Transcoding 




BACKGROUND OF THE INVENTION 

FIELD OF THE INVENTION 

This invention relates to transcoders for converting a signal stream compressed 
by a first coding scheme to a new signal stream according to a second coding scheme. 
The invention is particularly suitable for transcoding compressed digital video streams, 
such as the MPEG video streams. 

With the rapid advances in video communication technologies, more and more 
applications are making use of digital video. It is expected that, in the near future, users 
will be able to access a wide variety of services that include video on demand, 
videoconferencing, videotelephony, etc. However, transmitting uncompressed digital 
video data over long distance requires high t)andwidth links and is expensive. 

Video compression techniques have been developed to reduce transmission 
bandwidth with acceptable degradation in the picture quality. While there have been a 
lot of research efforts in developing new video coding algorithms, currently the most 
efficient video coding algorithms employ motion compensated prediction and transform 
coding to reduce the temporal redundancy and exploit the spatial conrelation. The gain 
of motion compensated prediction relies on the assumption that pixel values in a video 
frame mostly resemble the neighboring pixel valves in an adjacent frame, and relies 
further on average, that transmitting the prediction errors is more efficient than 
transmitting the pixel valves. 

The motion compensation prediction step involves searching for a block of pixels 
in the previous reconstructed picture(s), which is the best match to the block being 
predicted in the current picture. The motion prediction error is then calculated based on 
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a selected distance measure and quantized along with the motion vector, which is the 
differences between the coordinators of the two blocks. The transform coding involves 
transfomiing a block of pixel values or a block of prediction errors into another 
representation or another set of values in which many coefficients are or near zeros so 
that greater compression efficiency can be achieved. The most widely used transform 
for coding is the block based Discrete Cosine Transform (DCT). The block DCT based 
video coding standards, such as MPEG-1, MPEG-2, H.263, and Motion JPEG are and 
will likely still be the major digital video coding techniques for various wired and wireless 
video services and applications. Conventionally, what is known as a macroblock is a 16 
by 16 block of pixels; a slice is a row of macroblocks. 

Recently, there is an increasing need for the service provider to have effective 
means that can efficiently transcode pre-compressed high bit rate video streams into 
'S' lower bit rate streams in real time to meet the different bandwidth requirements of 

various services or applications. One of the prior art approaches for video transcoding 
5 is shown in Figure 1 . First, a decoder decodes the video compressed according to a 
first coding scheme back to the uncompressed video signal, then, an encoder encodes 
the reconstructed video signal according to another coding scheme. Another prior art 
approach is shown in Figure 2, wherein the compressed video is completely or partially 
decoded and the motion predictk)n and compensation parameters used by the first 
coding scheme are extracted and reused in the second coding scheme. 

In order to provide more detailed background, reference may be made to 
[1] M. Nilsson, D. Monison, M. Granbari, "Video Transcoder with By-Pass Transfer of 
Extracted Motion Compensation Data," US Patent 5940130, Aug. 17, 1999. 
[2] Q. Zhu, "Device and Method for Digital Video Transcoding,' US patent 5870146, 
Feb. 9, 1999. 
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SUMMARY OF THE INVENTION 

A primary object of the present invention is to provide a system and method for 
achieving high perfomnance video transcoding; yet low cost video transcoding. 

According to one aspect of the invention, there is provided a transcoding system 
and methodology that comprises an estimator, a decoder, and an encoder. The 
estimator gathers the infomiation and estimates the signal characteristics of the future 
incoming pictures to be transcoded, including look ahead into the incoming 
compressed signal to estimate the signal complexities of the future pictures. The 
outcome of the estimator is provided to the encoder in order to devise a second coding 
scheme for encoding the output of the decoder. The decoder, which can be either a 
complete decoder or a partial decoder, completely or partially decodes the compressed 
video signal coded according to a first coding scheme. The output of the decoder is 
either a completely or partially reconstructed video signal, depending on whether a full 
or partial decoder is used. The reconstructed video signal is then fed to the encoder. 
The encoder encodes the reconstructed video signal guided by a second coding 
scheme devised on the basis of the information from the estimator. 



If motion compensated prediction is used, the estimator can also gather the 
motion vector and prediction and picture type infonnation during the decoding and pass 
the infonnation to the encoder. The encoder can either reuse the motion vectors or 
derive new motion compensated prediction. For example, if a logo is inserted during 
the transcoding, the encoder may derive some other motion vectors and motion 
prediction types for area affected by the logo insertion. 

The foregoing and still further objects and advantages of the present invention 
will be more apparent from the following detailed explanation of the preferred 
embodiments of the invention in connection with the accompanying drawings: 
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BRIEF DESCRIPTION OF THE DRAWINGS 



Figure 1 is a diagram of a prior art transcoder. 

Figure 2 shows another prior art transcoding scheme. 

Figure 3 is a schematic drawing of a transcoder with an estimator according to 
one embodiment of the invention. 

Figure 4 shows a schematic drawing of a transcoder with a lool<-ahead estimator 
according to a second eml)odiment of the invention. 

DETAILED DESCRIPTION OF THE INVENTION 

C3 

'3 As the present invention may be applied in connection with MPEG-2 video 

J transcoding, it is understood that the invention may also be applied to other coding 
Ce algorithms that share some features of the MPEG-2 standard, such as MPEG-1 . H.263, 
fy Motion JPEG. etc. 

f y Figure 3 is the diagram of a transcoding system 300 according to one preferred 

1 J embodiment of the invention. As shown in the figure, the system 300 includes a decoder 
□ 310, an estimator 320 and an encoder 330. Each of the component units of the 

transcoding system 300 may be embodied in hardware or as software installed in the 

memory of a computer. 

The input to the system 300 is the compressed video signal 302 that is a data 
stream and compressed by a first coding scheme. The decoder 310 decodes the input 
signal 302. During the decoding of a picture, the estimator 320 will gather the 
information 306 needed for estimating the signal characteristics of the cun-ent picture. In 
the prefen-ed embodiment, the estimator derives the cun-ent picture complexity using 
the input information and an estimation model. The output 308 of the decoder 310 is 
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either a fully reconstmcted picture or partially reconstructed picture depending on 
whether a full decoder or a partial decoder Is used. The decoder 310 also may or may 
not pass to the encoder 330 the coding parameters 304 used in coding the current 
picture by the first coding scheme. In the prefen-ed emlxxliment. the motion vectors 
and prediction mode and picture coding type of the cunrent picture are passed to and 
reused by the encoder 330 so that very significant computation in motion estimation can 
be saved to achieve low cost but high performance transcoding. The encoder 330 uses 
the signal characteristics 312 from the estimator 320 to decide a coding scheme for 
encoding the signal 308 of the cun-ent picture. The encoder 330 may also decide 
whether to reuse any of the coding parameters 304 of the first coding scheme if they are 
forwarded from the decoder. The output 314 of the encoder 330 is another data stream 
of the compressed video signal. 



Q Figure 4 is the diagram of a transcoding system 400 according to another 

Q prefen-ed embodiment of the invention. As shown in the figure, the decoder 41 0 fully or 

J partially decodes the input 402 depending on a full or a partial decoder is employed. 

W The look-ahead estimator 420 collects the information from both the input compressed 

i ; i 

ru video signal 402 and the decoder 410 to estimate not only the signal characteristics of 
f , the current picture but also the signal characteristics of the future incoming pictures. It is 

f y advantageous for the encoder 430 to know the signal characteristics of the future 

rn 

incoming pictures so that it can derive a better coding scheme for encoding the current 
Q picture. For example, knowing the complexities of the current and future incoming 
pictures, the encoder can allocate the bits to encode the cunrent picture wisely so that 
the over all visual quality of the reconstructed pictures can be optimized. The encoder 
430 detemnines a coding scheme based on the signal 412 for encoding the delayed 
signal of the cunrent picture 408. It may also decide whether to reuse any of the coding 
parameters of the first coding scheme. In the preferred embodiment, the encoder 
reuses the motion vectors, prediction mode, and picture type so that low cost high 
efficient video transcoding can be achieved. 
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Besides estimating the signal characteristics at the picture levels, the estimators 
in both Figure 3 and Figure 4 can also estimate the signal characteristics of each 
portion of the picture, such as each slice or macroblock level. It is also very 
advantageous for the encoder to know the signal characteristics of various portions of a 
picture so that it can derive a coding scheme accordingly. For example, knowing the 
complexities of each portion of the picture, the encoder can optimally allocate bits to 
encode each portion. In the preferred embodiment, the estimator also derives the signal 
complexity for each macroblock of the picture and the encoder uses the information of 
the macroblock complexity in coding the picture. 
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